Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 25437 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Total size in memory | 3.6 MiB |
| Average record size in memory | 148.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Text | 15 |
Unnamed: 0 has unique values | Unique |
key_id has unique values | Unique |
home_team has 12718 (50.0%) zeros | Zeros |
away_team has 12719 (50.0%) zeros | Zeros |
starter has 4515 (17.7%) zeros | Zeros |
substitute has 20922 (82.3%) zeros | Zeros |
Reproduction
| Analysis started | 2023-10-23 21:38:48.585120 |
|---|---|
| Analysis finished | 2023-10-23 21:38:49.591310 |
| Duration | 1.01 second |
| Software version | ydata-profiling vv4.6.0 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
UNIQUE 
| Distinct | 25437 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12718 |
| Minimum | 0 |
|---|---|
| Maximum | 25436 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1271.8 |
| Q1 | 6359 |
| median | 12718 |
| Q3 | 19077 |
| 95-th percentile | 24164.2 |
| Maximum | 25436 |
| Range | 25436 |
| Interquartile range (IQR) | 12718 |
Descriptive statistics
| Standard deviation | 7343.173735 |
|---|---|
| Coefficient of variation (CV) | 0.5773843163 |
| Kurtosis | -1.2 |
| Mean | 12718 |
| Median Absolute Deviation (MAD) | 6359 |
| Skewness | 0 |
| Sum | 323507766 |
| Variance | 53922200.5 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 16967 | 1 | < 0.1% |
| 16965 | 1 | < 0.1% |
| 16964 | 1 | < 0.1% |
| 16963 | 1 | < 0.1% |
| 16962 | 1 | < 0.1% |
| 16961 | 1 | < 0.1% |
| 16960 | 1 | < 0.1% |
| 16959 | 1 | < 0.1% |
| 16958 | 1 | < 0.1% |
| Other values (25427) | 25427 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 |
| Value | Count | Frequency (%) |
| 25436 | 1 | |
| 25435 | 1 | |
| 25434 | 1 | |
| 25433 | 1 | |
| 25432 | 1 |
key_id
Real number (ℝ)
UNIQUE 
| Distinct | 25437 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12719 |
| Minimum | 1 |
|---|---|
| Maximum | 25437 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1272.8 |
| Q1 | 6360 |
| median | 12719 |
| Q3 | 19078 |
| 95-th percentile | 24165.2 |
| Maximum | 25437 |
| Range | 25436 |
| Interquartile range (IQR) | 12718 |
Descriptive statistics
| Standard deviation | 7343.173735 |
|---|---|
| Coefficient of variation (CV) | 0.5773389209 |
| Kurtosis | -1.2 |
| Mean | 12719 |
| Median Absolute Deviation (MAD) | 6359 |
| Skewness | 0 |
| Sum | 323533203 |
| Variance | 53922200.5 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 16968 | 1 | < 0.1% |
| 16966 | 1 | < 0.1% |
| 16965 | 1 | < 0.1% |
| 16964 | 1 | < 0.1% |
| 16963 | 1 | < 0.1% |
| 16962 | 1 | < 0.1% |
| 16961 | 1 | < 0.1% |
| 16960 | 1 | < 0.1% |
| 16959 | 1 | < 0.1% |
| Other values (25427) | 25427 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 |
| Value | Count | Frequency (%) |
| 25437 | 1 | |
| 25436 | 1 | |
| 25435 | 1 | |
| 25434 | 1 | |
| 25433 | 1 |
tournament_id
Text
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 178059 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | WC-1970 |
|---|---|
| 2nd row | WC-1970 |
| 3rd row | WC-1970 |
| 4th row | WC-1970 |
| 5th row | WC-1970 |
| Value | Count | Frequency (%) |
| wc-2018 | 1790 | 7.0% |
| wc-2014 | 1781 | 7.0% |
| wc-2006 | 1774 | 7.0% |
| wc-2010 | 1763 | 6.9% |
| wc-2002 | 1756 | 6.9% |
| wc-1998 | 1745 | 6.9% |
| wc-2019 | 1447 | 5.7% |
| wc-2015 | 1430 | 5.6% |
| wc-1994 | 1343 | 5.3% |
| wc-1990 | 1334 | 5.2% |
| Other values (10) | 9274 |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 25437 | |
| C | 25437 | |
| - | 25437 | |
| 0 | 23495 | |
| 1 | 21687 | |
| 9 | 18330 | |
| 2 | 17391 | |
| 8 | 7128 | 4.0% |
| 4 | 4060 | 2.3% |
| 7 | 3561 | 2.0% |
| Other values (3) | 6096 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 101748 | |
| Uppercase Letter | 50874 | |
| Dash Punctuation | 25437 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 23495 | |
| 1 | 21687 | |
| 9 | 18330 | |
| 2 | 17391 | |
| 8 | 7128 | 7.0% |
| 4 | 4060 | 4.0% |
| 7 | 3561 | 3.5% |
| 6 | 3107 | 3.1% |
| 5 | 2121 | 2.1% |
| 3 | 868 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 25437 | |
| C | 25437 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 127185 | |
| Latin | 50874 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 25437 | |
| 0 | 23495 | |
| 1 | 21687 | |
| 9 | 18330 | |
| 2 | 17391 | |
| 8 | 7128 | 5.6% |
| 4 | 4060 | 3.2% |
| 7 | 3561 | 2.8% |
| 6 | 3107 | 2.4% |
| 5 | 2121 | 1.7% |
Latin
| Value | Count | Frequency (%) |
| W | 25437 | |
| C | 25437 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178059 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| W | 25437 | |
| C | 25437 | |
| - | 25437 | |
| 0 | 23495 | |
| 1 | 21687 | |
| 9 | 18330 | |
| 2 | 17391 | |
| 8 | 7128 | 4.0% |
| 4 | 4060 | 2.3% |
| 7 | 3561 | 2.0% |
| Other values (3) | 6096 | 3.4% |
tournament_name
Text
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 27 |
|---|---|
| Median length | 25 |
| Mean length | 25.535755 |
| Min length | 25 |
Characters and Unicode
| Total characters | 649553 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1970 FIFA Men's World Cup |
|---|---|
| 2nd row | 1970 FIFA Men's World Cup |
| 3rd row | 1970 FIFA Men's World Cup |
| 4th row | 1970 FIFA Men's World Cup |
| 5th row | 1970 FIFA Men's World Cup |
| Value | Count | Frequency (%) |
| fifa | 25437 | |
| world | 25437 | |
| cup | 25437 | |
| men's | 18623 | |
| women's | 6814 | 5.4% |
| 2018 | 1790 | 1.4% |
| 2014 | 1781 | 1.4% |
| 2006 | 1774 | 1.4% |
| 2010 | 1763 | 1.4% |
| 2002 | 1756 | 1.4% |
| Other values (15) | 16573 |
Most occurring characters
| Value | Count | Frequency (%) |
| 101748 | 15.7% | |
| F | 50874 | 7.8% |
| o | 32251 | 5.0% |
| W | 32251 | 5.0% |
| ' | 25437 | 3.9% |
| C | 25437 | 3.9% |
| d | 25437 | 3.9% |
| l | 25437 | 3.9% |
| r | 25437 | 3.9% |
| s | 25437 | 3.9% |
| Other values (18) | 279807 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 242561 | |
| Uppercase Letter | 178059 | |
| Space Separator | 101748 | |
| Decimal Number | 101748 | |
| Other Punctuation | 25437 | 3.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 32251 | |
| d | 25437 | |
| l | 25437 | |
| r | 25437 | |
| s | 25437 | |
| n | 25437 | |
| p | 25437 | |
| e | 25437 | |
| u | 25437 | |
| m | 6814 | 2.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 23495 | |
| 1 | 21687 | |
| 9 | 18330 | |
| 2 | 17391 | |
| 8 | 7128 | 7.0% |
| 4 | 4060 | 4.0% |
| 7 | 3561 | 3.5% |
| 6 | 3107 | 3.1% |
| 5 | 2121 | 2.1% |
| 3 | 868 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 50874 | |
| W | 32251 | |
| C | 25437 | |
| A | 25437 | |
| I | 25437 | |
| M | 18623 | 10.5% |
Space Separator
| Value | Count | Frequency (%) |
| 101748 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 25437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 420620 | |
| Common | 228933 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 50874 | 12.1% |
| o | 32251 | 7.7% |
| W | 32251 | 7.7% |
| C | 25437 | 6.0% |
| d | 25437 | 6.0% |
| l | 25437 | 6.0% |
| r | 25437 | 6.0% |
| s | 25437 | 6.0% |
| n | 25437 | 6.0% |
| p | 25437 | 6.0% |
| Other values (6) | 127185 |
Common
| Value | Count | Frequency (%) |
| 101748 | ||
| ' | 25437 | 11.1% |
| 0 | 23495 | 10.3% |
| 1 | 21687 | 9.5% |
| 9 | 18330 | 8.0% |
| 2 | 17391 | 7.6% |
| 8 | 7128 | 3.1% |
| 4 | 4060 | 1.8% |
| 7 | 3561 | 1.6% |
| 6 | 3107 | 1.4% |
| Other values (2) | 2989 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 649553 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 101748 | 15.7% | |
| F | 50874 | 7.8% |
| o | 32251 | 5.0% |
| W | 32251 | 5.0% |
| ' | 25437 | 3.9% |
| C | 25437 | 3.9% |
| d | 25437 | 3.9% |
| l | 25437 | 3.9% |
| r | 25437 | 3.9% |
| s | 25437 | 3.9% |
| Other values (18) | 279807 |
match_id
Text
| Distinct | 951 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 228933 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M-1970-01 |
|---|---|
| 2nd row | M-1970-01 |
| 3rd row | M-1970-01 |
| 4th row | M-1970-01 |
| 5th row | M-1970-01 |
| Value | Count | Frequency (%) |
| m-2018-56 | 30 | 0.1% |
| m-2018-60 | 30 | 0.1% |
| m-2018-51 | 30 | 0.1% |
| m-2018-62 | 30 | 0.1% |
| m-2018-52 | 30 | 0.1% |
| m-2019-40 | 30 | 0.1% |
| m-2019-38 | 29 | 0.1% |
| m-2015-37 | 28 | 0.1% |
| m-2018-01 | 28 | 0.1% |
| m-2015-52 | 28 | 0.1% |
| Other values (941) | 25144 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 50874 | |
| 0 | 30626 | |
| 1 | 29902 | |
| M | 25437 | |
| 2 | 25423 | |
| 9 | 20683 | |
| 4 | 9910 | 4.3% |
| 8 | 9544 | 4.2% |
| 3 | 7468 | 3.3% |
| 5 | 6701 | 2.9% |
| Other values (2) | 12365 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 152622 | |
| Dash Punctuation | 50874 | 22.2% |
| Uppercase Letter | 25437 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 30626 | |
| 1 | 29902 | |
| 2 | 25423 | |
| 9 | 20683 | |
| 4 | 9910 | 6.5% |
| 8 | 9544 | 6.3% |
| 3 | 7468 | 4.9% |
| 5 | 6701 | 4.4% |
| 6 | 6402 | 4.2% |
| 7 | 5963 | 3.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 50874 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 25437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 203496 | |
| Latin | 25437 | 11.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 50874 | |
| 0 | 30626 | |
| 1 | 29902 | |
| 2 | 25423 | |
| 9 | 20683 | |
| 4 | 9910 | 4.9% |
| 8 | 9544 | 4.7% |
| 3 | 7468 | 3.7% |
| 5 | 6701 | 3.3% |
| 6 | 6402 | 3.1% |
Latin
| Value | Count | Frequency (%) |
| M | 25437 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 228933 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 50874 | |
| 0 | 30626 | |
| 1 | 29902 | |
| M | 25437 | |
| 2 | 25423 | |
| 9 | 20683 | |
| 4 | 9910 | 4.3% |
| 8 | 9544 | 4.2% |
| 3 | 7468 | 3.3% |
| 5 | 6701 | 2.9% |
| Other values (2) | 12365 | 5.4% |
match_name
Text
| Distinct | 748 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 36 |
|---|---|
| Median length | 32 |
| Mean length | 19.61351574 |
| Min length | 12 |
Characters and Unicode
| Total characters | 498909 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mexico vs Soviet Union |
|---|---|
| 2nd row | Mexico vs Soviet Union |
| 3rd row | Mexico vs Soviet Union |
| 4th row | Mexico vs Soviet Union |
| 5th row | Mexico vs Soviet Union |
| Value | Count | Frequency (%) |
| vs | 25437 | |
| germany | 3353 | 3.9% |
| brazil | 2770 | 3.2% |
| england | 2012 | 2.4% |
| argentina | 1960 | 2.3% |
| united | 1933 | 2.3% |
| italy | 1893 | 2.2% |
| sweden | 1879 | 2.2% |
| states | 1856 | 2.2% |
| france | 1841 | 2.2% |
| Other values (91) | 40488 |
Most occurring characters
| Value | Count | Frequency (%) |
| 59985 | 12.0% | |
| a | 55572 | 11.1% |
| s | 35122 | 7.0% |
| e | 33564 | 6.7% |
| n | 32727 | 6.6% |
| r | 28902 | 5.8% |
| v | 27499 | 5.5% |
| i | 26342 | 5.3% |
| t | 20064 | 4.0% |
| l | 18871 | 3.8% |
| Other values (37) | 160261 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 379528 | |
| Space Separator | 59985 | 12.0% |
| Uppercase Letter | 59396 | 11.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 55572 | |
| s | 35122 | |
| e | 33564 | |
| n | 32727 | 8.6% |
| r | 28902 | 7.6% |
| v | 27499 | 7.2% |
| i | 26342 | 6.9% |
| t | 20064 | 5.3% |
| l | 18871 | 5.0% |
| o | 18662 | 4.9% |
| Other values (15) | 82203 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 9611 | |
| C | 5659 | |
| N | 4943 | 8.3% |
| A | 4742 | 8.0% |
| B | 4554 | 7.7% |
| G | 4209 | 7.1% |
| U | 3434 | 5.8% |
| I | 3406 | 5.7% |
| E | 2992 | 5.0% |
| P | 2562 | 4.3% |
| Other values (11) | 13284 |
Space Separator
| Value | Count | Frequency (%) |
| 59985 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 438924 | |
| Common | 59985 | 12.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 55572 | 12.7% |
| s | 35122 | 8.0% |
| e | 33564 | 7.6% |
| n | 32727 | 7.5% |
| r | 28902 | 6.6% |
| v | 27499 | 6.3% |
| i | 26342 | 6.0% |
| t | 20064 | 4.6% |
| l | 18871 | 4.3% |
| o | 18662 | 4.3% |
| Other values (36) | 141599 |
Common
| Value | Count | Frequency (%) |
| 59985 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 498909 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 59985 | 12.0% | |
| a | 55572 | 11.1% |
| s | 35122 | 7.0% |
| e | 33564 | 6.7% |
| n | 32727 | 6.6% |
| r | 28902 | 5.8% |
| v | 27499 | 5.5% |
| i | 26342 | 5.3% |
| t | 20064 | 4.0% |
| l | 18871 | 3.8% |
| Other values (37) | 160261 |
match_date
Text
| Distinct | 380 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 254370 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1970-05-31 |
|---|---|
| 2nd row | 1970-05-31 |
| 3rd row | 1970-05-31 |
| 4th row | 1970-05-31 |
| 5th row | 1970-05-31 |
| Value | Count | Frequency (%) |
| 1991-11-19 | 154 | 0.6% |
| 1991-11-21 | 154 | 0.6% |
| 1991-11-17 | 128 | 0.5% |
| 2018-06-16 | 112 | 0.4% |
| 2006-06-22 | 112 | 0.4% |
| 2018-06-26 | 112 | 0.4% |
| 2014-06-23 | 112 | 0.4% |
| 2018-06-27 | 112 | 0.4% |
| 2007-09-15 | 112 | 0.4% |
| 2014-06-24 | 112 | 0.4% |
| Other values (370) | 24217 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 56695 | |
| - | 50874 | |
| 1 | 36387 | |
| 2 | 28605 | |
| 6 | 25547 | |
| 9 | 21941 | 8.6% |
| 8 | 9459 | 3.7% |
| 7 | 9154 | 3.6% |
| 4 | 6592 | 2.6% |
| 5 | 4846 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 203496 | |
| Dash Punctuation | 50874 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 56695 | |
| 1 | 36387 | |
| 2 | 28605 | |
| 6 | 25547 | |
| 9 | 21941 | 10.8% |
| 8 | 9459 | 4.6% |
| 7 | 9154 | 4.5% |
| 4 | 6592 | 3.2% |
| 5 | 4846 | 2.4% |
| 3 | 4270 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 50874 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 254370 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 56695 | |
| - | 50874 | |
| 1 | 36387 | |
| 2 | 28605 | |
| 6 | 25547 | |
| 9 | 21941 | 8.6% |
| 8 | 9459 | 3.7% |
| 7 | 9154 | 3.6% |
| 4 | 6592 | 2.6% |
| 5 | 4846 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 254370 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 56695 | |
| - | 50874 | |
| 1 | 36387 | |
| 2 | 28605 | |
| 6 | 25547 | |
| 9 | 21941 | 8.6% |
| 8 | 9459 | 3.7% |
| 7 | 9154 | 3.6% |
| 4 | 6592 | 2.6% |
| 5 | 4846 | 1.9% |
stage_name
Text
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 11 |
| Mean length | 11.42776271 |
| Min length | 5 |
Characters and Unicode
| Total characters | 290688 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | group stage |
|---|---|
| 2nd row | group stage |
| 3rd row | group stage |
| 4th row | group stage |
| 5th row | group stage |
| Value | Count | Frequency (%) |
| group | 19254 | |
| stage | 19254 | |
| round | 2379 | 4.7% |
| of | 2379 | 4.7% |
| 16 | 2379 | 4.7% |
| quarter-finals | 1067 | 2.1% |
| second | 902 | 1.8% |
| quarter-final | 757 | 1.5% |
| semi-finals | 584 | 1.1% |
| third-place | 534 | 1.0% |
| Other values (3) | 1396 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| g | 38508 | |
| r | 25815 | |
| 25448 | ||
| a | 25416 | |
| o | 24914 | |
| e | 23462 | |
| u | 23457 | |
| s | 22755 | |
| t | 22146 | |
| p | 19788 | |
| Other values (12) | 38979 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 257176 | |
| Space Separator | 25448 | 8.8% |
| Decimal Number | 4758 | 1.6% |
| Dash Punctuation | 3306 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| g | 38508 | |
| r | 25815 | |
| a | 25416 | |
| o | 24914 | |
| e | 23462 | |
| u | 23457 | |
| s | 22755 | |
| t | 22146 | |
| p | 19788 | |
| n | 6551 | 2.5% |
| Other values (8) | 24364 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2379 | |
| 1 | 2379 |
Space Separator
| Value | Count | Frequency (%) |
| 25448 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3306 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 257176 | |
| Common | 33512 | 11.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| g | 38508 | |
| r | 25815 | |
| a | 25416 | |
| o | 24914 | |
| e | 23462 | |
| u | 23457 | |
| s | 22755 | |
| t | 22146 | |
| p | 19788 | |
| n | 6551 | 2.5% |
| Other values (8) | 24364 |
Common
| Value | Count | Frequency (%) |
| 25448 | ||
| - | 3306 | 9.9% |
| 6 | 2379 | 7.1% |
| 1 | 2379 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 290688 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| g | 38508 | |
| r | 25815 | |
| 25448 | ||
| a | 25416 | |
| o | 24914 | |
| e | 23462 | |
| u | 23457 | |
| s | 22755 | |
| t | 22146 | |
| p | 19788 | |
| Other values (12) | 38979 |
group_name
Text
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 7 |
| Mean length | 8.701497818 |
| Min length | 7 |
Characters and Unicode
| Total characters | 221340 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Group 1 |
|---|---|
| 2nd row | Group 1 |
| 3rd row | Group 1 |
| 4th row | Group 1 |
| 5th row | Group 1 |
| Value | Count | Frequency (%) |
| group | 19254 | |
| not | 6183 | 12.2% |
| applicable | 6183 | 12.2% |
| b | 2901 | 5.7% |
| a | 2886 | 5.7% |
| c | 2609 | 5.1% |
| d | 2284 | 4.5% |
| e | 1797 | 3.5% |
| f | 1786 | 3.5% |
| h | 1000 | 2.0% |
| Other values (7) | 3991 | 7.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 31620 | |
| o | 25437 | |
| 25437 | ||
| G | 20244 | |
| u | 19254 | |
| r | 19254 | |
| a | 12366 | 5.6% |
| l | 12366 | 5.6% |
| c | 6211 | 2.8% |
| e | 6183 | 2.8% |
| Other values (17) | 42968 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 157423 | |
| Uppercase Letter | 35479 | 16.0% |
| Space Separator | 25437 | 11.5% |
| Decimal Number | 3001 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 31620 | |
| o | 25437 | |
| u | 19254 | |
| r | 19254 | |
| a | 12366 | 7.9% |
| l | 12366 | 7.9% |
| c | 6211 | 3.9% |
| e | 6183 | 3.9% |
| n | 6183 | 3.9% |
| b | 6183 | 3.9% |
| Other values (2) | 12366 | 7.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 20244 | |
| B | 2901 | 8.2% |
| A | 2886 | 8.1% |
| C | 2581 | 7.3% |
| D | 2284 | 6.4% |
| E | 1797 | 5.1% |
| F | 1786 | 5.0% |
| H | 1000 | 2.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 682 | |
| 3 | 681 | |
| 1 | 675 | |
| 2 | 663 | |
| 6 | 151 | 5.0% |
| 5 | 149 | 5.0% |
Space Separator
| Value | Count | Frequency (%) |
| 25437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 192902 | |
| Common | 28438 | 12.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| p | 31620 | |
| o | 25437 | |
| G | 20244 | |
| u | 19254 | |
| r | 19254 | |
| a | 12366 | 6.4% |
| l | 12366 | 6.4% |
| c | 6211 | 3.2% |
| e | 6183 | 3.2% |
| n | 6183 | 3.2% |
| Other values (10) | 33784 |
Common
| Value | Count | Frequency (%) |
| 25437 | ||
| 4 | 682 | 2.4% |
| 3 | 681 | 2.4% |
| 1 | 675 | 2.4% |
| 2 | 663 | 2.3% |
| 6 | 151 | 0.5% |
| 5 | 149 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 221340 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| p | 31620 | |
| o | 25437 | |
| 25437 | ||
| G | 20244 | |
| u | 19254 | |
| r | 19254 | |
| a | 12366 | 5.6% |
| l | 12366 | 5.6% |
| c | 6211 | 2.8% |
| e | 6183 | 2.8% |
| Other values (17) | 42968 |
team_id
Text
| Distinct | 84 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 101748 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | T-46 |
|---|---|
| 2nd row | T-46 |
| 3rd row | T-46 |
| 4th row | T-46 |
| 5th row | T-46 |
| Value | Count | Frequency (%) |
| t-09 | 1375 | 5.4% |
| t-31 | 1103 | 4.3% |
| t-28 | 1010 | 4.0% |
| t-03 | 984 | 3.9% |
| t-41 | 948 | 3.7% |
| t-74 | 944 | 3.7% |
| t-83 | 934 | 3.7% |
| t-30 | 923 | 3.6% |
| t-48 | 780 | 3.1% |
| t-73 | 746 | 2.9% |
| Other values (74) | 15690 |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 25437 | |
| - | 25437 | |
| 4 | 7306 | 7.2% |
| 3 | 6904 | 6.8% |
| 1 | 6546 | 6.4% |
| 0 | 6226 | 6.1% |
| 8 | 5200 | 5.1% |
| 7 | 4715 | 4.6% |
| 6 | 4067 | 4.0% |
| 5 | 3979 | 3.9% |
| Other values (2) | 5931 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 50874 | |
| Uppercase Letter | 25437 | |
| Dash Punctuation | 25437 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 7306 | |
| 3 | 6904 | |
| 1 | 6546 | |
| 0 | 6226 | |
| 8 | 5200 | |
| 7 | 4715 | |
| 6 | 4067 | |
| 5 | 3979 | |
| 2 | 3907 | |
| 9 | 2024 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 25437 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 76311 | |
| Latin | 25437 | 25.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 25437 | |
| 4 | 7306 | 9.6% |
| 3 | 6904 | 9.0% |
| 1 | 6546 | 8.6% |
| 0 | 6226 | 8.2% |
| 8 | 5200 | 6.8% |
| 7 | 4715 | 6.2% |
| 6 | 4067 | 5.3% |
| 5 | 3979 | 5.2% |
| 2 | 3907 | 5.1% |
Latin
| Value | Count | Frequency (%) |
| T | 25437 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 101748 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 25437 | |
| - | 25437 | |
| 4 | 7306 | 7.2% |
| 3 | 6904 | 6.8% |
| 1 | 6546 | 6.4% |
| 0 | 6226 | 6.1% |
| 8 | 5200 | 5.1% |
| 7 | 4715 | 4.6% |
| 6 | 4067 | 4.0% |
| 5 | 3979 | 3.9% |
| Other values (2) | 5931 | 5.8% |
team_name
Text
| Distinct | 84 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 22 |
|---|---|
| Median length | 20 |
| Mean length | 7.809254236 |
| Min length | 4 |
Characters and Unicode
| Total characters | 198644 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mexico |
|---|---|
| 2nd row | Mexico |
| 3rd row | Mexico |
| 4th row | Mexico |
| 5th row | Mexico |
| Value | Count | Frequency (%) |
| germany | 1679 | 5.6% |
| brazil | 1375 | 4.6% |
| england | 1010 | 3.4% |
| argentina | 984 | 3.3% |
| united | 973 | 3.2% |
| italy | 948 | 3.2% |
| sweden | 944 | 3.1% |
| states | 934 | 3.1% |
| france | 923 | 3.1% |
| netherlands | 780 | 2.6% |
| Other values (90) | 19448 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 27779 | |
| e | 16798 | 8.5% |
| n | 16387 | 8.2% |
| r | 14443 | 7.3% |
| i | 13162 | 6.6% |
| t | 10056 | 5.1% |
| l | 9429 | 4.7% |
| o | 9328 | 4.7% |
| d | 6559 | 3.3% |
| u | 5810 | 2.9% |
| Other values (37) | 68893 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 164379 | |
| Uppercase Letter | 29704 | 15.0% |
| Space Separator | 4561 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 27779 | |
| e | 16798 | |
| n | 16387 | |
| r | 14443 | |
| i | 13162 | 8.0% |
| t | 10056 | 6.1% |
| l | 9429 | 5.7% |
| o | 9328 | 5.7% |
| d | 6559 | 4.0% |
| u | 5810 | 3.5% |
| Other values (15) | 34628 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 4815 | |
| C | 2823 | |
| N | 2467 | 8.3% |
| A | 2371 | 8.0% |
| B | 2266 | 7.6% |
| G | 2105 | 7.1% |
| U | 1729 | 5.8% |
| I | 1703 | 5.7% |
| E | 1502 | 5.1% |
| P | 1284 | 4.3% |
| Other values (11) | 6639 |
Space Separator
| Value | Count | Frequency (%) |
| 4561 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 194083 | |
| Common | 4561 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 27779 | |
| e | 16798 | 8.7% |
| n | 16387 | 8.4% |
| r | 14443 | 7.4% |
| i | 13162 | 6.8% |
| t | 10056 | 5.2% |
| l | 9429 | 4.9% |
| o | 9328 | 4.8% |
| d | 6559 | 3.4% |
| u | 5810 | 3.0% |
| Other values (36) | 64332 |
Common
| Value | Count | Frequency (%) |
| 4561 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 198644 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 27779 | |
| e | 16798 | 8.5% |
| n | 16387 | 8.2% |
| r | 14443 | 7.3% |
| i | 13162 | 6.6% |
| t | 10056 | 5.1% |
| l | 9429 | 4.7% |
| o | 9328 | 4.7% |
| d | 6559 | 3.3% |
| u | 5810 | 2.9% |
| Other values (37) | 68893 |
team_code
Text
| Distinct | 83 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 76311 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MEX |
|---|---|
| 2nd row | MEX |
| 3rd row | MEX |
| 4th row | MEX |
| 5th row | MEX |
| Value | Count | Frequency (%) |
| deu | 1603 | 6.3% |
| bra | 1375 | 5.4% |
| eng | 1010 | 4.0% |
| arg | 984 | 3.9% |
| ita | 948 | 3.7% |
| swe | 944 | 3.7% |
| usa | 934 | 3.7% |
| fra | 923 | 3.6% |
| nld | 780 | 3.1% |
| esp | 746 | 2.9% |
| Other values (73) | 15190 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 8872 | 11.6% |
| A | 8161 | 10.7% |
| E | 6435 | 8.4% |
| N | 6344 | 8.3% |
| U | 5540 | 7.3% |
| S | 4761 | 6.2% |
| G | 3692 | 4.8% |
| C | 3436 | 4.5% |
| D | 3293 | 4.3% |
| L | 3004 | 3.9% |
| Other values (16) | 22773 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 76311 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 8872 | 11.6% |
| A | 8161 | 10.7% |
| E | 6435 | 8.4% |
| N | 6344 | 8.3% |
| U | 5540 | 7.3% |
| S | 4761 | 6.2% |
| G | 3692 | 4.8% |
| C | 3436 | 4.5% |
| D | 3293 | 4.3% |
| L | 3004 | 3.9% |
| Other values (16) | 22773 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 76311 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 8872 | 11.6% |
| A | 8161 | 10.7% |
| E | 6435 | 8.4% |
| N | 6344 | 8.3% |
| U | 5540 | 7.3% |
| S | 4761 | 6.2% |
| G | 3692 | 4.8% |
| C | 3436 | 4.5% |
| D | 3293 | 4.3% |
| L | 3004 | 3.9% |
| Other values (16) | 22773 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 76311 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 8872 | 11.6% |
| A | 8161 | 10.7% |
| E | 6435 | 8.4% |
| N | 6344 | 8.3% |
| U | 5540 | 7.3% |
| S | 4761 | 6.2% |
| G | 3692 | 4.8% |
| C | 3436 | 4.5% |
| D | 3293 | 4.3% |
| L | 3004 | 3.9% |
| Other values (16) | 22773 |
home_team
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5000196564 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 12718 |
| Zeros (%) | 50.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5000098281 |
|---|---|
| Coefficient of variation (CV) | 0.9999803442 |
| Kurtosis | -2.000157264 |
| Mean | 0.5000196564 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -7.863026099 × 10-5 |
| Sum | 12719 |
| Variance | 0.2500098282 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 12719 | |
| 0 | 12718 |
| Value | Count | Frequency (%) |
| 0 | 12718 | |
| 1 | 12719 |
| Value | Count | Frequency (%) |
| 1 | 12719 | |
| 0 | 12718 |
away_team
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4999803436 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 12719 |
| Zeros (%) | 50.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.5000098281 |
|---|---|
| Coefficient of variation (CV) | 1.000058971 |
| Kurtosis | -2.000157264 |
| Mean | 0.4999803436 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.863026099 × 10-5 |
| Sum | 12718 |
| Variance | 0.2500098282 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 12719 | |
| 1 | 12718 |
| Value | Count | Frequency (%) |
| 0 | 12719 | |
| 1 | 12718 |
| Value | Count | Frequency (%) |
| 1 | 12718 | |
| 0 | 12719 |
player_id
Text
| Distinct | 6086 |
|---|---|
| Distinct (%) | 23.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 178059 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 992 ? |
|---|---|
| Unique (%) | 3.9% |
Sample
| 1st row | P-66980 |
|---|---|
| 2nd row | P-64553 |
| 3rd row | P-42664 |
| 4th row | P-69898 |
| 5th row | P-56971 |
| Value | Count | Frequency (%) |
| p-49502 | 25 | 0.1% |
| p-62104 | 24 | 0.1% |
| p-25850 | 24 | 0.1% |
| p-89236 | 24 | 0.1% |
| p-27787 | 24 | 0.1% |
| p-43222 | 23 | 0.1% |
| p-80404 | 21 | 0.1% |
| p-19610 | 21 | 0.1% |
| p-97813 | 21 | 0.1% |
| p-41187 | 21 | 0.1% |
| Other values (6076) | 25209 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 25437 | |
| - | 25437 | |
| 9 | 13343 | |
| 8 | 12930 | |
| 4 | 12888 | |
| 2 | 12885 | |
| 7 | 12868 | |
| 3 | 12781 | |
| 6 | 12761 | |
| 5 | 12382 | |
| Other values (2) | 24347 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 127185 | |
| Uppercase Letter | 25437 | 14.3% |
| Dash Punctuation | 25437 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 13343 | |
| 8 | 12930 | |
| 4 | 12888 | |
| 2 | 12885 | |
| 7 | 12868 | |
| 3 | 12781 | |
| 6 | 12761 | |
| 5 | 12382 | |
| 0 | 12277 | |
| 1 | 12070 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 25437 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 152622 | |
| Latin | 25437 | 14.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 25437 | |
| 9 | 13343 | |
| 8 | 12930 | |
| 4 | 12888 | |
| 2 | 12885 | |
| 7 | 12868 | |
| 3 | 12781 | |
| 6 | 12761 | |
| 5 | 12382 | |
| 0 | 12277 |
Latin
| Value | Count | Frequency (%) |
| P | 25437 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178059 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 25437 | |
| - | 25437 | |
| 9 | 13343 | |
| 8 | 12930 | |
| 4 | 12888 | |
| 2 | 12885 | |
| 7 | 12868 | |
| 3 | 12781 | |
| 6 | 12761 | |
| 5 | 12382 | |
| Other values (2) | 24347 |
family_name
Text
| Distinct | 5068 |
|---|---|
| Distinct (%) | 19.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 23 |
|---|---|
| Median length | 20 |
| Mean length | 6.760506349 |
| Min length | 1 |
Characters and Unicode
| Total characters | 171967 |
|---|---|
| Distinct characters | 118 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 745 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | Calderón |
|---|---|
| 2nd row | Peña |
| 3rd row | Pérez |
| 4th row | Hernández |
| 5th row | Salgado |
| Value | Count | Frequency (%) |
| van | 248 | 0.9% |
| de | 210 | 0.8% |
| kim | 133 | 0.5% |
| lee | 111 | 0.4% |
| rodríguez | 78 | 0.3% |
| silva | 68 | 0.3% |
| müller | 62 | 0.2% |
| der | 57 | 0.2% |
| larsson | 50 | 0.2% |
| sánchez | 48 | 0.2% |
| Other values (5074) | 25313 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 17630 | 10.3% |
| e | 15570 | 9.1% |
| i | 12183 | 7.1% |
| o | 11778 | 6.8% |
| n | 11471 | 6.7% |
| r | 11448 | 6.7% |
| l | 8062 | 4.7% |
| s | 7492 | 4.4% |
| t | 5442 | 3.2% |
| u | 5130 | 3.0% |
| Other values (108) | 65761 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 144143 | |
| Uppercase Letter | 26473 | 15.4% |
| Space Separator | 941 | 0.5% |
| Dash Punctuation | 272 | 0.2% |
| Other Punctuation | 138 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 17630 | |
| e | 15570 | |
| i | 12183 | 8.5% |
| o | 11778 | 8.2% |
| n | 11471 | 8.0% |
| r | 11448 | 7.9% |
| l | 8062 | 5.6% |
| s | 7492 | 5.2% |
| t | 5442 | 3.8% |
| u | 5130 | 3.6% |
| Other values (63) | 37937 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2522 | 9.5% |
| M | 2348 | 8.9% |
| B | 2236 | 8.4% |
| C | 1647 | 6.2% |
| A | 1527 | 5.8% |
| K | 1526 | 5.8% |
| L | 1301 | 4.9% |
| R | 1301 | 4.9% |
| G | 1274 | 4.8% |
| P | 1244 | 4.7% |
| Other values (32) | 9547 |
Space Separator
| Value | Count | Frequency (%) |
| 941 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 272 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 138 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 170616 | |
| Common | 1351 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 17630 | 10.3% |
| e | 15570 | 9.1% |
| i | 12183 | 7.1% |
| o | 11778 | 6.9% |
| n | 11471 | 6.7% |
| r | 11448 | 6.7% |
| l | 8062 | 4.7% |
| s | 7492 | 4.4% |
| t | 5442 | 3.2% |
| u | 5130 | 3.0% |
| Other values (105) | 64410 |
Common
| Value | Count | Frequency (%) |
| 941 | ||
| - | 272 | 20.1% |
| ' | 138 | 10.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 167776 | |
| None | 4191 | 2.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 17630 | 10.5% |
| e | 15570 | 9.3% |
| i | 12183 | 7.3% |
| o | 11778 | 7.0% |
| n | 11471 | 6.8% |
| r | 11448 | 6.8% |
| l | 8062 | 4.8% |
| s | 7492 | 4.5% |
| t | 5442 | 3.2% |
| u | 5130 | 3.1% |
| Other values (45) | 61570 |
None
| Value | Count | Frequency (%) |
| ć | 572 | |
| é | 553 | |
| á | 483 | |
| í | 457 | 10.9% |
| ö | 234 | 5.6% |
| ó | 209 | 5.0% |
| ü | 151 | 3.6% |
| ñ | 134 | 3.2% |
| ø | 122 | 2.9% |
| č | 106 | 2.5% |
| Other values (53) | 1170 |
given_name
Text
| Distinct | 3086 |
|---|---|
| Distinct (%) | 12.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 6.474662893 |
| Min length | 2 |
Characters and Unicode
| Total characters | 164696 |
|---|---|
| Distinct characters | 101 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 350 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | Ignacio |
|---|---|
| 2nd row | Gustavo |
| 3rd row | Mario |
| 4th row | Guillermo |
| 5th row | Horacio López |
| Value | Count | Frequency (%) |
| not | 1430 | 5.2% |
| applicable | 1430 | 5.2% |
| carlos | 213 | 0.8% |
| luis | 182 | 0.7% |
| josé | 179 | 0.7% |
| david | 152 | 0.6% |
| roberto | 150 | 0.5% |
| thomas | 142 | 0.5% |
| john | 131 | 0.5% |
| fernando | 125 | 0.5% |
| Other values (3045) | 23239 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 18360 | 11.1% |
| i | 14289 | 8.7% |
| e | 13594 | 8.3% |
| n | 12962 | 7.9% |
| o | 10987 | 6.7% |
| l | 9835 | 6.0% |
| r | 9732 | 5.9% |
| t | 5561 | 3.4% |
| s | 5187 | 3.1% |
| u | 4134 | 2.5% |
| Other values (91) | 60055 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 137021 | |
| Uppercase Letter | 24742 | 15.0% |
| Space Separator | 1936 | 1.2% |
| Dash Punctuation | 984 | 0.6% |
| Other Punctuation | 13 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 18360 | |
| i | 14289 | |
| e | 13594 | |
| n | 12962 | |
| o | 10987 | 8.0% |
| l | 9835 | 7.2% |
| r | 9732 | 7.1% |
| t | 5561 | 4.1% |
| s | 5187 | 3.8% |
| u | 4134 | 3.0% |
| Other values (47) | 32380 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2411 | 9.7% |
| A | 2278 | 9.2% |
| J | 2273 | 9.2% |
| S | 1719 | 6.9% |
| R | 1505 | 6.1% |
| C | 1397 | 5.6% |
| D | 1237 | 5.0% |
| L | 1205 | 4.9% |
| G | 1096 | 4.4% |
| K | 1014 | 4.1% |
| Other values (31) | 8607 |
Space Separator
| Value | Count | Frequency (%) |
| 1936 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 984 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 13 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 161763 | |
| Common | 2933 | 1.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 18360 | 11.3% |
| i | 14289 | 8.8% |
| e | 13594 | 8.4% |
| n | 12962 | 8.0% |
| o | 10987 | 6.8% |
| l | 9835 | 6.1% |
| r | 9732 | 6.0% |
| t | 5561 | 3.4% |
| s | 5187 | 3.2% |
| u | 4134 | 2.6% |
| Other values (88) | 57122 |
Common
| Value | Count | Frequency (%) |
| 1936 | ||
| - | 984 | |
| ' | 13 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 162401 | |
| None | 2295 | 1.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 18360 | 11.3% |
| i | 14289 | 8.8% |
| e | 13594 | 8.4% |
| n | 12962 | 8.0% |
| o | 10987 | 6.8% |
| l | 9835 | 6.1% |
| r | 9732 | 6.0% |
| t | 5561 | 3.4% |
| s | 5187 | 3.2% |
| u | 4134 | 2.5% |
| Other values (45) | 57760 |
None
| Value | Count | Frequency (%) |
| é | 659 | |
| á | 293 | |
| í | 221 | 9.6% |
| ó | 187 | 8.1% |
| ł | 99 | 4.3% |
| ü | 92 | 4.0% |
| ú | 89 | 3.9% |
| ë | 77 | 3.4% |
| Á | 71 | 3.1% |
| š | 69 | 3.0% |
| Other values (36) | 438 |
shirt_number
Real number (ℝ)
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.29284114 |
| Minimum | 1 |
|---|---|
| Maximum | 23 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 10 |
| Q3 | 15 |
| 95-th percentile | 21 |
| Maximum | 23 |
| Range | 22 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 6.133736605 |
|---|---|
| Coefficient of variation (CV) | 0.5959225954 |
| Kurtosis | -1.042409125 |
| Mean | 10.29284114 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.2655827522 |
| Sum | 261819 |
| Variance | 37.62272474 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 1544 | 6.1% |
| 9 | 1479 | 5.8% |
| 1 | 1444 | 5.7% |
| 11 | 1430 | 5.6% |
| 6 | 1415 | 5.6% |
| 8 | 1412 | 5.6% |
| 7 | 1395 | 5.5% |
| 4 | 1383 | 5.4% |
| 5 | 1374 | 5.4% |
| 3 | 1370 | 5.4% |
| Other values (13) | 11191 |
| Value | Count | Frequency (%) |
| 1 | 1444 | |
| 2 | 1303 | |
| 3 | 1370 | |
| 4 | 1383 | |
| 5 | 1374 |
| Value | Count | Frequency (%) |
| 23 | 277 | 1.1% |
| 22 | 460 | |
| 21 | 690 | |
| 20 | 944 | |
| 19 | 913 |
position_name
Text
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 20 |
|---|---|
| Median length | 16 |
| Mean length | 10.83810984 |
| Min length | 7 |
Characters and Unicode
| Total characters | 275689 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | goal keeper |
|---|---|
| 2nd row | defender |
| 3rd row | defender |
| 4th row | midfielder |
| 5th row | forward |
| Value | Count | Frequency (%) |
| midfielder | 8809 | |
| center | 6018 | |
| forward | 5515 | |
| back | 4369 | |
| defender | 3722 | |
| left | 2085 | 5.4% |
| right | 2083 | 5.4% |
| goal | 1937 | 5.0% |
| keeper | 1937 | 5.0% |
| winger | 859 | 2.2% |
| Other values (6) | 1190 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 50975 | |
| r | 34769 | |
| d | 30931 | |
| i | 21524 | 7.8% |
| f | 20400 | 7.4% |
| 13087 | 4.7% | |
| l | 12831 | 4.7% |
| a | 12747 | 4.6% |
| n | 11563 | 4.2% |
| t | 11197 | 4.1% |
| Other values (11) | 55665 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 262602 | |
| Space Separator | 13087 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 50975 | |
| r | 34769 | |
| d | 30931 | |
| i | 21524 | |
| f | 20400 | |
| l | 12831 | 4.9% |
| a | 12747 | 4.9% |
| n | 11563 | 4.4% |
| t | 11197 | 4.3% |
| c | 10935 | 4.2% |
| Other values (10) | 44730 |
Space Separator
| Value | Count | Frequency (%) |
| 13087 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 262602 | |
| Common | 13087 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 50975 | |
| r | 34769 | |
| d | 30931 | |
| i | 21524 | |
| f | 20400 | |
| l | 12831 | 4.9% |
| a | 12747 | 4.9% |
| n | 11563 | 4.4% |
| t | 11197 | 4.3% |
| c | 10935 | 4.2% |
| Other values (10) | 44730 |
Common
| Value | Count | Frequency (%) |
| 13087 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 275689 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 50975 | |
| r | 34769 | |
| d | 30931 | |
| i | 21524 | 7.8% |
| f | 20400 | 7.4% |
| 13087 | 4.7% | |
| l | 12831 | 4.7% |
| a | 12747 | 4.6% |
| n | 11563 | 4.2% |
| t | 11197 | 4.1% |
| Other values (11) | 55665 |
position_code
Text
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 198.9 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.005778983 |
| Min length | 2 |
Characters and Unicode
| Total characters | 51021 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | GK |
|---|---|
| 2nd row | DF |
| 3rd row | DF |
| 4th row | MF |
| 5th row | FW |
| Value | Count | Frequency (%) |
| mf | 4937 | |
| df | 3722 | |
| fw | 3697 | |
| cb | 2373 | |
| cm | 2137 | |
| gk | 1937 | 7.6% |
| cf | 1508 | 5.9% |
| rb | 927 | 3.6% |
| lb | 922 | 3.6% |
| rm | 502 | 2.0% |
| Other values (11) | 2775 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 14174 | |
| M | 8809 | |
| C | 6018 | |
| W | 4844 | 9.5% |
| B | 4369 | 8.6% |
| D | 3991 | 7.8% |
| L | 2085 | 4.1% |
| R | 2083 | 4.1% |
| G | 1937 | 3.8% |
| K | 1937 | 3.8% |
| Other values (2) | 774 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 51021 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 14174 | |
| M | 8809 | |
| C | 6018 | |
| W | 4844 | 9.5% |
| B | 4369 | 8.6% |
| D | 3991 | 7.8% |
| L | 2085 | 4.1% |
| R | 2083 | 4.1% |
| G | 1937 | 3.8% |
| K | 1937 | 3.8% |
| Other values (2) | 774 | 1.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51021 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 14174 | |
| M | 8809 | |
| C | 6018 | |
| W | 4844 | 9.5% |
| B | 4369 | 8.6% |
| D | 3991 | 7.8% |
| L | 2085 | 4.1% |
| R | 2083 | 4.1% |
| G | 1937 | 3.8% |
| K | 1937 | 3.8% |
| Other values (2) | 774 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51021 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 14174 | |
| M | 8809 | |
| C | 6018 | |
| W | 4844 | 9.5% |
| B | 4369 | 8.6% |
| D | 3991 | 7.8% |
| L | 2085 | 4.1% |
| R | 2083 | 4.1% |
| G | 1937 | 3.8% |
| K | 1937 | 3.8% |
| Other values (2) | 774 | 1.5% |
starter
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8225026536 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 4515 |
| Zeros (%) | 17.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3820965559 |
|---|---|
| Coefficient of variation (CV) | 0.4645535843 |
| Kurtosis | 0.8500915476 |
| Mean | 0.8225026536 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -1.688201622 |
| Sum | 20922 |
| Variance | 0.145997778 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 20922 | |
| 0 | 4515 | 17.7% |
| Value | Count | Frequency (%) |
| 0 | 4515 | 17.7% |
| 1 | 20922 |
| Value | Count | Frequency (%) |
| 1 | 20922 | |
| 0 | 4515 | 17.7% |
substitute
Real number (ℝ)
ZEROS 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1774973464 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 20922 |
| Zeros (%) | 82.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 99.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3820965559 |
|---|---|
| Coefficient of variation (CV) | 2.152688835 |
| Kurtosis | 0.8500915476 |
| Mean | 0.1774973464 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.688201622 |
| Sum | 4515 |
| Variance | 0.145997778 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 20922 | |
| 1 | 4515 | 17.7% |
| Value | Count | Frequency (%) |
| 0 | 20922 | |
| 1 | 4515 | 17.7% |
| Value | Count | Frequency (%) |
| 1 | 4515 | 17.7% |
| 0 | 20922 |